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ABSTRACT. We show that the Bayesian star paradox, first proved mathematically by Steel 
and Matsen for a specific class of prior distributions, occurs in a wider context including 
less regular, possibly discontinuous, prior distributions. 



Introduction 

In phylogenetics, a particular resolved tree can be highly supported even when the data is 
generated by an unresolved star tree. This unfortunate aspect of the Bayesian approach 
to phylogeny reconstruction is called the star paradox. Recent studies highlight that the 
paradox can occur in the simplest nontrivial setting, namely for an unresolved rooted tree 
on three taxa and two states, see Yang and Rannala [7| and Lewis et al. (TJ. Kolaczkowski 
and Thornton [ 2 1 presented some simulations and suggested that artifactual high posteriors 
for a particular resolved tree might disappear for very long sequences. Previous simulations 
in were plagued by numerical problems, which left unknown the nature of the limiting 
distribution on posterior probabilities. For an introduction to the Bayesian approach to 
phylogeny reconstruction we refer to chapter 5 of Yang [5 ]. 

The statistical question which supports the star paradox is whether the Bayesian poste- 
rior distribution of the resolutions of a star tree becomes uniform when the length of the 
sequence tends to infinity, that is, in the case of three taxa and two states, whether the 
posterior distribution of each resolution converges to 1/3. In a recent paper, Steel and 
Matsen [3| disprove this, thus ruining Kolaczkowski and Thornton's hope, for a specific 
class of branch length priors which they call tame. More precisely, Steel and Matsen show 
that, for every tame prior and every fixed e > 0, the posterior probability of any of the 
three possible trees stays above 1 — e with non vanishing probability when the length of 
the sequence goes to infinity. This result was recognized by Yang and reinforced by 
theoretical results on the posterior probabilities by Susko [4]. 

Our main result is that Steel and Matsen's conclusion holds for a wider class of priors, 
possibly highly irregular, which we call tempered. Recall that Steel and Matsen consider 
smooth priors whose densities satisfy some regularity conditions. 

The paper is organized as follows. In Section Q] we describe the Bayesian framework of 
the star paradox. In Section[2] we define the class of tempered priors on the branch lengths 
and we state our main result. In Section[3] we state an extension of a technical lemma due 
to Steel and Matsen, which allows us to extend their result. In Section we prove our 
main result. Section|5]is devoted to the proofs of intermediate results. In AppendixlAl we 
prove that every tame prior, in Steel and Matsen's sense, is tempered, in the sense of this 
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paper, and we provide examples of tempered, but not tame, prior distributions. Finally, 
in Appendix |B] we prove the extension of Steel and Matsen's technical lemma stated in 
Section [3] 

1. Bayesian framework for rooted trees on three taxa 

We consider three taxa, encoded by the set T = {1,2,3}, with two possible states. Phyto- 
genies on T are supported by one of the four following trees: the star tree Rq on three taxa 
and, for every taxon i in T, the tree Rj such that ; is the outlier. Relying on a commonly 
used notation, this reads as 

*! = (1,(2,3)), /? 2 = (2,(1,3)), J? 3 = (3,(1,2)). 

The phylogeny based on Rq is specified by the common length of its three branches, de- 
noted by t . For each i in T, the phylogeny based on Rj is specified by a pair of branch 
lengths (f e , fi), where f e denotes the external branch length and t\ the internal branch length, 
see figure Q] 

For instance, in the phylogeny based on R\, the divergence of taxa 2 and 3 occurred f e units 
of time ago and the divergence of taxon 1 and a common ancestor of taxa 2 and 3 occurred 
f; + t e units of time ago. 




I 2 3 2 3 1 3 1 2 1 2 3 



Figure 1 . The four rooted trees for three species. 

We assume that the sequences evolve according to a two-state continuous-time Markov 
process with equal substitution rates (which we may take to equal 1) between the two 
character states. 

Four site patterns can occur. The first one, denoted by so, is such that a given site coincides 
in the three taxa. The three others, denoted by s,- with i in T, are such that a given site 
coincide in two taxa and is different in the third taxon, which is taxon ;. In other words, if 
one writes the site patterns in taxa 1, 2 and 3 in this order and x and y for any two different 
characters, 

So = xxx, s\=yxx, S2 = xyx, and s^=xxy. 
Let {sq , si , S2 , S3 } denote the set of site patterns in the specific case described above of three 
taxa and two states evolving in a two-state symmetric model. Assume that the counting 
of site pattern s, is n,. Then n = «o + w i + w 2 + »3 is the total length of the sequences 
and, in the independent two-state symmetric model considered in this paper, the quadruple 
(«0,«i,«2,«3) ls a sufficient statistics of the sequence data. We use the letter n to denote any 
quadruple («o,«i ,«2,«3) of nonnegative integers such that |n| = «o + «i +«2 + «3 = n ^ 1. 
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For every site pattern si and every branch lengths (? e ,fi), let pi(t e ,ti) denote the probability 
that St occurs on tree Ri with branch lengths (t e ,ti). Standard computations provided by 
Yang and Rannala [7| show that 

4po(fe,fi) = l+e- 4 '=+2e-^+'=), 

4p 1 (f e ,f i ) = l+e- 4 '=-2e- 4 ^), 

4p2(t s ,ti)=4p3(t e ,ti) = 1 -e- 4 '=. 

Let % = (T e ,Ti) denote a pair of positive random variables representing the branch lengths 
(f e ,fi), and 91 = (Nq,Ni ,N2,N?,) denote a quadruple of integer random variables represent- 
ing the counts of sites patterns n = (riQ,n\,n>i,n-}). 

2. The star tree paradox 

Assuming that every taxon evolved from a common ancestor, the aim of phylogeny recon- 
struction is to compute the most likely tree To do so, in the Bayesian approach, one 
places prior distributions on the trees /?, and on their branch lengths 1 = (r e , 7j). 

2. 1 . Main result. Let P(9T = n\Ri,t) denote the probability that 91 = n assuming that the 
data is generated along the tree R[ conditionally on the branch lengths T — ( 7^ , 7] ) . One 
may consider R\ only since, for every n = («o,«i,«2,H3), the symmetries of the setting 
yield the relations 

P(91 = n\R 2 ,1) = P(«Jt= (no,n 2 ,n 3 ,ni)|tf 1 ,I), 

and 

F(m = n\R3,1) =P(9t= (n ,W3,rai,/i 2 )|/?i,T). 
Notation 2.1. For every site pattern su let Pi denote the random variable 

P,=p l (T)= P i(T e ,T i ). 
For every i in X and every n, let IT, (n) denote the random variable 
n,(n) = P^P^'P^, with {i,j,k} = x. 

We recall that P 2 = Pj, and we note that, if |n| = «q + n\ + n 2 + «3 =n with n ^ 1, then, for 
every i in x, 

H i {n)=P n (l P n l 'P' 2 1 - n °- n '. 

Fix n and assume that |n| = no + "i +«2 +«3 = n with n ^ 1. For every i in T, the posterior 
probability of Ri conditionally on 91 = n is 

P(*,|9t = n) - 1 E(n,-(n)). 

no'«i'«2'«3' P(9l = n) 

Thus, for every 2 and y in t, 

P(gfjgt=n) _ E(n,-(n)) 
¥{R j \m = n) E(n ; -(n))' 

For every e > and every i in x, let JV^ denote the set of n such that, for both indices j in 
x such that j ^ i, 

E(n,-(n)) > (2/e)E(n»). 
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One sees that, for every i in T and n in jVf ', 

F(Ri\yt = n) ^ 1 - e, 

which means that the posterior probability of tree among the three possible trees is 
highly supported. 

Recall that, under hypothesis Rq and for a tame prior distribution on T = (T e , 7]), Steel and 
Matsen prove that, for every i in t, P(9T € ^(f £ ) does not go to when the sequence length 
n goes to infinity, and consequently that the posterior probability P(/?,|9 r t) can be close to 
1 even when the sequence length n is large. 

As stated in the introduction, our aim is to prove the same result for tempered prior distri- 
butions of 1 = (T e ,Ti), which we now define. 

Notation 2.2. (1 ) For every s € [0, 1] and z € [0, 3], let 

G{z, s ) = P (e- 47i ( 1 - e- 4r < s \ eT* T * ( 1 + 2^ ) = z) . 

(2) For every positive t and every site pattern Sj, let qi denote the probability that st occurs 
on tree Rq, hence 

4q = 4po(0,f) = 1 + 3e- 4 ', Aq x = 4q 2 = 4q 3 = 1 - e" 4 '. 

(3) Let t t denote a positive real number such that 1 < Aq$ — l t and Aqo + £ t <4,for instance 
l t = 3e~ 4 ' (1 — e~ 4 '). Let I and I t denote the intervals 

I = [0, 3] , /, = [4q -1-4,4^-1+4] C]0, 3 [. 

(4) For every positive t and integer n, let 

Q„(t)=F(Ti^l/n,t^T s ^t+l/n). 

Definition 2.3 (Tempered priors). The distribution ofl = (T e ,T{) is tempered if the fol- 
lowing two conditions hold. 

(1) For every t, there exists a real number sq in ]0, 1], an interval I t around 4qa — 1, 
some bounded functions Ft, some positive numbers (X and K, an integer k 1 and 
some real numbers such that 

= 6o < £i < • • • < e fc _i < 2 < e k , 

and such that for every s in [0, So] an d every z in I t , 



G(z,s)- k fF(z)s a+£ ' 

i=0 



S$ Ks a+Ek . 



(2) For every positive t, n \ogQ n (t) — > when n — > °°. 

We detail the properties involved in Definition ^ . 3 l and provide examples of tempered priors 
in subsectionl2.2lbelow. 



We now state our main result, which is an extension of Steel and Matsen's result to our 
more general setting. 

Theorem 2.4. Consider sequences of length n generated by a star tree Rq on 3 taxa with 
strictly positive edge length t. Let 9^ be the resulting data, summarized by site pattern 
counts. Consider any prior on the three resolved trees {R\,R2,Ri) which assigns strictly 
positive probability to each tree, and a tempered prior distribution on their branch lengths 

s = (r e ,7i). 
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Then, for every i in T and every positive £, there exists a positive 8 such that, when n is 
large enough, 

P(P(J&|*K)>l-e)>5. 
We prove Theorem l2.4l in Section|4] 

2.2. Motivation and intuitive understanding of Definition 12.31 In Definition 12.31 con- 
dition |2] is easy to describe, to illustrate and to check, while the content of condition Q] 
might be more difficult to grasp. Condition Q] involves a Taylor expansion around s = of 
the conditional cumulative distribution function s H> G(z,s), where the Taylor coefficients 
depend on z. Such a Taylor expansion roughly describes the prior distribution when t ; — > 
and when ? e is roughly constant. The precise definition of G(z, •) and the technical result 
stated in Proposition 13.21 are both dictated by our approach to the proof of Theorem 12.41 
A key hypothesis is that £o = while £<. > 2, which means that we are given a limited 
expansion of s H > G(z, s) up to a better order than s 2 when s — >• 0. 

At this point, the reader can wonder how to check if a given prior is tempered or not and 
if the verification is simply possible in concrete cases, given the convoluted aspect of this 
definition. Hence we now present some explicit examples of tempered priors. We begin 
with the following result. 

Proposition 2.5. Assume that T = (T e , 7j) has a smooth joint probability density, bounded 
and everywhere non zero. Then the distribution ofH = (T e , Tj) is tempered. 

As a consequence, every tame prior fulfills the hypothesis of Proposition ^. 51 hence every 
tame prior is tempered, as claimed in the introduction. This case includes the exponential 
priors discussed in [7]. We prove Proposition ^. 5l in Appendix lAl 

However some tempered priors are not tame, as illustrated by the following example where 
Steel and Matsen's condition fails. 

Definition 2.6. Let a > and b > 0. Let (t n ), (y n ) and (r n ) denote sequences of positive 
numbers, indexed by n 1 and defined by the formulas 

t n =n-\ y„ = l+2e- 4 '« r n = y„ (V fe - (n + 1)-*) . 

Finally, let 

Proposition 2.7. In the setting of Definition \2.6\ assume the following: 

(i) 3a < min{l,fo}. 

(ii) The random variable T[ is discrete and such that, for every n 1, 

P(7j = tn) = r n /r. 

(iii) The random variable T e is continuous, independent of 7], with exponential law 
of parameter A, that is, with density 4e~ 4 ' on t with respect to the Lebesgue 
measure. 

Then, the distribution ofE = (T e ,Ti) is not tame but it is tempered, for the parameters 
k = 3, a = b/a, £\ = 1 , £2=2, £3 = 3. 
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Since the distribution of 7; is an accumulation of Dirac masses, the prior distribution of 
T = (T e ,Ti) cannot be tame. 

Yet, the fact that the prior distribution is tempered does not come only from the fact that 
the distribution of 7j is discrete. For a degenerate example, if 7| — almost surely, then 
G(z,s) = 1 for every s ^ 0, and G(z, •) has no Taylor expansion around zero whose first 
term is a positive power of s. Note that in this particular case, the Bayesian star paradox 
does not occur. 

However, under the conditions of Proposition ^. 71 G(z, •) has a Taylor expansion at ful- 
filling condition[T]of Definition 12. 3 1 We prove this in Appendix lAl 

We provide below some examples of less ill-behaved distributions which are tempered but 
not tame, and an example of a distribution which does not fulfill condition [T] hence is not 
tempered. 

Proposition 2.8. Assume that T e is a continuous random variable, with exponential law 
of parameter 4, that is, with density 4e~ 4 ' on t ^ with respect to the Lebesgue measure, 
and that T\ is a random variable independent ofT e . Then, the following holds. 

(i) If the distribution ofT\ is uniform on [0, 9], with 9 > 0, the distribution of '% = 
(T e ,T[) is tempered but not tame. 

(ii) If the distribution of 7] has density 9t®~ 1 on the interval [0, 1], for a given 9 in 
(0, 1), the distribution of% = (T e , 7]) is tempered but not tame. 

(iii) If the distribution ofT, has density log(l jt\) on the interval [0, 1], the distribution 
ofH = (T e ,T{) is not tempered. 

(iv) If the distribution ofT\ has density 4f;log(l jt\) on the interval [0, 1], the distribu- 
tion ofl = (T e , 7j) is not tempered. 

Note that in case (iv), the density function of T = (T e ,T{) is bounded, non smooth but 
continuous, but the distribution is not tempered. 

We prove Proposition ^. 8l in AppendixlAl 



3. Extension of Steel and Matsen's lemma 



The Bayesian star paradox due to Steel and Matsen relies on a technical result which we 
slightly rephrase as follows. For every nonnegative real t and every [0, 1] valued random 
variable V, introduce 

M, = E(V ), R, = l — = v , , ' ■ 

Proposition 3.1 (Steel and Matsen's lemma). Let ^ f] < 1 and B > 0. There exists a 
finite K, which depends on t] and B only, such that the following holds. For every [0, 1] 
valued random variable V with a smooth probability density function f such that /( 1 ) > 
and |/'(v)| ^ Bf(\) for every 7] $J v ^ 1, and for every integer k ^ K, 

2kR k > 1. 

Indeed the asymptotics of when k is large depends on the behaviour of the distribution 
of V around 1 . 



Our next proposition proves that the conclusion of Steel and Matsen's lemma above holds 
for a wider class of random variables. 
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Proposition 3.2. Let V a random variable on [0, 1]. Suppose that there exists an integer 
n 1 and real numbers ^ vo < 1, 05 > 0, £; and y, such that 

= £o < £i < • ■ ■ < £„-i 1 < £„, 

and, for every vo v ^ 1, 

P(V^v)-£^(l-v) a+e ' 

1=0 

Then there exists a finite 9{y), which depends continuously on y — (yo, . . . , y n ), such that 
for every t 9{y), 

2tR, ^ a. 

Remark 1. We insist on the fact that 9{y) depends continuously on the multiparameter 
Y = (%!•••>%)■ To wit, in the proof of Proposition 15.61 we apply Proposition 13.21 with 
bounded functions of z. This means that for every z in I t , one gets a number 9 which 
depends on z through the bounded functions such that the control on the distribution ofV 
holds. The continuity of 9 ensures that there exists a number independent of z such that 
Proposition ^ .6\ holds. 

Remark 2. If one computes a Taylor expansion of the function v 1— > P(V ^ v) at v = 1 ~~ un- 
der the conditions of Steel and Matsen's lemma, one sees that conditions of Proposition^^ 
hold. Hence Proposition \3. 21 is an extension of Steel and Matsen's lemma. 

We prove Proposition l3.2l in Appendix IE1 The proof of Theorem l2.4l relies on it. 



<y„(i-v) 



a+£„ 



4. Synopsis of the proof of Theorem|2.4 



This section is devoted to a sketch of the proof of Theorem 12.41 We use the definitions 

in) 

below. Note that the set F c - defined below is not the set introduced by Steel and Matsen. 
For a technical reason in the proof of Proposition l4.2l stated below, we had to modify their 
definition. Note however that Propositions 14.21 and |4.3| below are adaptations of ideas in 
Steel and Matsen's paper. 

Notation 4.1. Define functions A,- as follows. For every nonnegative integers n = («o> w i> w 2j w 3) 
such that I n | = «o + n i +«2 + n 3 = n w ' m n 1> 

Ao(n = p=— , 

V" 

and, for every i in X, 

A/ (n) = n '~ 1/3 ^~ no) . 

For every c > 1, introduce 

F^ n) = {n; |n| = n, -2c < A 2 (n) < -c, -2c < A 3 (n) < -c, -c < A (n) < 0}. 
For every i in T and every positive r\, let A'^ denote the event 

4 = {Vj € T, ./ ^ i, E(n,-(A0 |JV) > i7E(n y -(JV) |JV)} . 
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Since A] + A2 + A3 = 0, every rt in F^ is such that 2c ^ A[ (n) ^ 4c. We note that Fc is 
not symmetric about T and gives a preference to 1 . That is why we only deal with in 

the following proof. To deal with A^, one would change the definition Fc accordingly. 

From the reasoning in Section [2] it suffices to prove that for every positive 77, there exists 
a positive 8 such that, when n is large enough, 

Suppose that one generates n 1 sites on the star tree Rq with given branch length t and 
let 91 = (Nq,N\ ,N2,Ns) denote the counts of site patterns defined in Section[T| hence No + 

N { + N 2 +N 3 =n. 

From central limit estimates, the probability of the event is uniformly bounded 

from below, say by 8 > 0, when n is large enough. Hence, 

We wish to prove that there exists a positive a independent of c such that for n large enough 
and for every n in F^ and for j = 2 and j = 3, 

E(ni(n)) >c 2 aE(nj(n)). 
This follows from the two results below, adapted from Steel and Matsen's paper. 

Proposition 4.2. Fix t and assume that n is in fJ \ Then, when n is large enough, for 
j = 2 and j = 3, 

E(Uj(n) \4P - 1 €l t ) > E(n» |4P - 1 ^ /,)• 

(n) 

Proposition 4.3. Fix t and assume that n is in Fc . Then, there exists a positive a, inde- 
pendent of c, such that for every z in I t , and for j = 2 and j = 3, 

E(ni(n) |4F -l=z)> c 2 aE(n,(n) |4F - 1 = z). 

We prove Propositions 14. 2l and l4.3l in Section[5] 
From these two results, for j = 2 and j = 3, 

E(ni(n)) ^c 2 aP(4P -l e/r)E(n 7 -(n)). 

Assume that c is so large that c 2 aP(4Po — 1 £ h) ^ I?- Then, for every n in fJ"' and for 
j = 2 and /' = 3, 

E(n 1 (n))^T ? E(n,(n)). 
This implies that P ^Ajj | 9t e Ff"^ = 1 , which yields the theorem. 

5. Proofs of Propositions 14. 2I and 14. 31 

5.1. Proof of Proposition 14.21 The proof is decomposed into two intermediate results, 
stated as lemmata below and using estimates on auxiliary random variables introduced 
below. 
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Notation 5.1. For every n > 1 and t > 0, let T t {n) = [0, 1/n] x [?,? + 1 /«]. 

For every t > 0, let /I, — q^q^q^q 1 ^ = qf^i 11 an d Ut denote the random variable 

u t =fi(p,/ q ,r. 

i=0 

For every n and for j — 2 and j — 3, Zef W)(n) denote the random variable 

Wj(n) = pAo(n)p(A,-Ao/3)(n) jp (A 1+ A,-2A /3)(„) ; ^ {j . fe} = {2;3} 

One sees that 

u, = C^P^/fr, Quit) = P(1 € r,(n)) , 

and, for /' = 2 and /' = 3, 

W/ = (fl)/ft) Ao (A/ft) Ay-V3 - 

Lemma 5.2. (7) For every n in Fc and for j — 2 and j = 3, W/(n) ^ 1. 

(2) For every n /« F e and for j — 2 and j = 3, W ; (n) (oj ) £ on f/ie even? {T € r f («)}. 
(3j r/iere exists a finite constant K such that U" ^ e~ K uniformly on the integer n ^ 1 and 
on the event {2" £ T t (n)}. 

Proof of Lemma\5l\ (1) For every T, F > Pi > Pi- On F c {n) , A < and for j = 2 and 
j = 3, Aj — Ao/3 < hence 

(Po/Pt^^l, (P Q /P 2 ) A J - A ^^L 

This proves the claim. 

(2) One has Pq ^ 1 everywhere and P\^ q\ and P 2 ^ q\ on the event {1 E F t (n)}. On 
F c (n) , A s$ and for j = 2 and j = 3, A,- - A /3 sC hence Wj > q 2 A ^ 2A ° /3 . Finally, on 
Fc , A/ + 2Ao/3 ^ — c. This proves the claim. 

(3) For every 1 in T t (n), one has 7] ^ and r e f, hence Fi ^ q\ and P 2 ^ q 2 = q\- 
Likewise, 7| ^ 1 /« and T e ^ f + 1 jn hence 

F ^ P0 (l/n,t+l/n) > q - 5e- 4 '(l - e - 4 /")/4. 

This yields that, for every n ^ 1 and every 1 in r r (n), 

^ (1 -5e- 4t /(q n))" -> e X p(-5e- 4 '/ ? o) > 0, 

which implies the desired lower bound. □ 

Lemma 5.3. For every n in F c and for j = 2 and j = 3, 

E(rL;(n) |4F - 1 G /,) ^ Mr"G«(0«P(-O(Vn)), 

and 

E(n y (n) |4F - 1 i I t ) < vfexp(-n#/32). 

Proof of Lemma [Ol Since Fo = po(T), for every 2 in T,(n), when n is large, 4Fo — 1 is in 
the interval 7, . Consequently, 

E(n ; -(n)|4Fj-i e/,) ^e„(f)E(n ; -(n)|Ter r («)). 

On the event {T € T t (n)}, 
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from parts (2) and (3) of Lemma I5T21 which proves the first part of the lemma. 

Turning to the second part, let &kl denote the Kullback-Leibler distance between discrete 
probability measures. When APq — 1 is not in I t , 

d KL (q,P) > (1/2)|| 9 -P||? > (l/2)(q -Po) 2 >l 2 /32. 

Note that 

n y (n) = ^Wj{n)^tsKp(-ndKL(q,P)), 
hence the estimate on dKL(q,P), and part (1) of Lemma IBT21 imply the second part of the 
lemma. □ 

Turning finally to the proof of Proposition 14.21 we note that Q n (t) = because the 
distribution of X is tempered. Furthermore, Lemma l5~3l shows that, when n is large enough, 

E(n» 1 4P - 1 e I t ) > E(n y -(n) 1 4P Q - 1 f I,), 

and this concludes the proof of Proposition ^. 21 



5.2. Proof of Proposition 14.31 Our proof of Proposition |43] is based on Lemma IB31 and 
Proposition l5.6l below. 

Notation 5.4. For every u in [0, 1], let t,{u) = (1 + 2m) (1 — u) 2 . Let U and V denote the 
random variables defined as 

u=(p 1 -p 2 )/(i-po), v = au). 

In) 

Lemma 5.5. For every n in Fc and for j = 2 and j = 3, 

Egli(n)jifr) 2 E(V>(1-V)\P Q ) . , ... 

„,„ , , . — - ^ 4c n — : — : . where s — (n — no) 3. 

E(n y (n)|/b) K(V S \P Q ) 1 K " 



Proof of Lemma 1331 Recall that, for every c > 1, F c v is 

F c (n) = {n : |n| = n, -2c < A 2 (n) < c,-2c < A 3 (n) < c-c < A (n) «S 0}. 
Using the A variables, one can rewrite TIi, IT2 and II3 as 

U i (n)=P^(PiPiY(P l /P 2 ) Ai ^, i=W,s=(n-n )/3. 

(n) 

Assume that n is in F c . Then, Ai (n) ^ 2c, Aj(n) for j = 2 and j = 3, and P\ ^ P2. 
Hence 

n l (n)>p" °(p l p 2 2 y(p l /p 2 ) 2c ^, n;(n) ^ppfrpiy. 

Furthermore, 

P X P 2 = (1/27)V(1 -Po)\ P1/P2 = (1 +2E/)/(l -E/), 
hence for j = 2 and j = 3, 



m(n)\Po) ®[V°((l+2U)/(l-U)r^\P 



E(rL»|P ) E(V S \P ) 
Direct computations (or Lemma 3.2 in Steel and Matsen [3 |) show that, for every u in [0, 1) 
and every m ^ 3, 

((l+2«)/(l-«))"^m 2 (l-C(«)), 

hence 

((1 +2U)/(l - U)) 2cV " > 4c 2 n (1 - V). 
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The conclusion of Lemma 1531 follows . □ 



Proposition 5.6. Assume that the distribution ofT is tempered. Then there exists 9 and a, 
both positive and independent of c, such that for every s ^ 9, on the event {4Pq - 1 6 /(}, 

4sE(V s (l-V)\P )^aE{V s \P ). 



Proof of Proposition \5.6\ We recall that U and V denote random variables defined as 

U=(P l -P 2 )/(l-P ), V = C(U), C(«)-(1+2«)(1-m) 2 . 



To use Proposition l3.2l one must compute a Taylor expansion at v = 1 or, equivalently, at 
u = + , of the conditional probability 

P(V^v\P )=F(U ^u\P ), 

where u = £ _1 (v). Besides, for v close to 1, 

u = £ -1 (v) =w/V3 + w 2 /9 + 5w 3 /54-V3 + 0(w 4 ), with w = a/I"^. 

Since U = (Pi -P 2 )/(l-Po), 

P(C7 s; M |4P -l =z) =P(S e (3-Si) ^2s\S e Si = z), 

where we used the notations 

S e ^ e - 4T % S i = l+2e- 4T ' 1 2s = u(3-z). 

Using Definition ^. 31 one has 

G(z, s)=¥ (S e (3 - Si) < 2s | S e Si = z) . 

Since the distribution of 1 is tempered, there exists some bounded functions Ft defined on 
It, a positive number a,n + l real numbers 

= £o < £i < • ■ ■ < £„_i 2 < £„, 

and two positive numbers k and so such that for every ^ s ^ So an d every z in 

G(z,s)-£Fi(z> a+ei ^™ a+£n - 

i=0 

Combining this with the relation 2s — u(3 — z) and the expansion of u — £~ 1 (v) along the 
powers of w, one sees that there exists some bounded functions /, on a positive number 
K 1 and ^ vo < 1 such that for every vo ^ v ^ 1 and every zE It, 

n-l 

P(V > v\4P - 1 =z) - £ /j(z)(l - y) a / 2+£ '/ 2 < jc'(1 - y) a/2+£ " /2 . 
i=0 

Since the functions /, are bounded and positive on Proposition 13.21 implies that there 
exists a positive number 9 such that for every z in I t and every s ^ 9, the conclusion of 
Proposition l5.6l holds. □ 

Assuming this, the proof of Proposition l4.3l is as follows. Let s, 9 and a be as in Lemma l531 
and Proposition l5.6l Since n — «o = (1 — qo)n — Aoy/n ^ (1 — go)« for every n in F c , one 
knows that s = (n — «o)/3 ^ 6 when n is large enough. Furthermore, s ^ n/3. Finally, for 
every n in Fc with n large enough, on the event {4Pq — 1 G /,} and for y = 2 and j = 3, 

E(n,(n)|^) ^a^oECn/n)!^,). 

This concludes the proof of Proposition l4.3l 
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Appendix A. Proof of Propositions 12. 51 1 2.71 and I2.8I 

Notation A.l. Introduce the random variables 

(S e ,Si) = g(T e ,Ti), where g(t e , ti ) = {^.1+2^), 

that is, 

S e = e - 47i , Sj = l+2e- 47 \ 

Hence, G(z, •) is defined by 

G(z,s)=P(3S e ^2s + z\S e Si=z), 

A.l. Proof of Proposition 12.51 The distribution of (5^,5^) has a smooth joint probability 
density, say GJ, defined on the set < x ^ 1 < y ^ 3 by 

. , (Oog- l (x,y) 

°m = i4-i) • 

For tame priors, the probability Q n (t ) introduced in condition|2]of Definition ^. 3l is of order 
1 /n 2 , hence this condition holds. 

The definition of G(z,s) as a conditional expectation can be rewritten as 

G(z, s) = P (3S e < 2s + S e Si | S & Si = z) . 
Hence, for every measurable bounded function H, 

E (H(S&) ; 3S e < 2s + S e S { ) = E (H(S&)G(S&,s)) , 

that is, 

J J H(xy)l{3x^2s+xy}®(x,y)dxdy= 1 1 H(xy)G(xy,s)Cj(x,y)dxdy. 
The change of variable z—xy yields 

Jj H(z)l{3x ^2s + z}®(x,z/x)dzdx/x = Jj H{z)G{z,s)W (x,z/x)dzdx/x. 
This must hold for every measurable bounded function H, hence one can choose 

G(z,s)=H(z,s)/H(z,°°), 
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with 

H(z,s) = J l{3x^2s+z}tS(x,z/x)6x/x. 

Since 0^S e ^l^Sj^3 almost surely, the integral defining H(z,s) may be further re- 
stricted to the range ^ x ^ 1 and z/3 ^ x ^ z. Finally, for every s ^ and z £ [0, 3], 

G(z,s)=H(z,s)/H(z,l), 

where 

rm(s,z) 

H(z,s) = / 07 (x,z/x)6x/x, withm(i,z) =min{l, z,(2s + z)/3}. 

Jm(0,z) 

Hence, m(Q,z) = z/3 and, for small positive values of s, m(s,z) = m(0,z) +2s/3. When 

^ z ^ 1, m(s,z) —> m(°°,z) = z when s — >• °° and this limit is reached for s = z. When 

1 ^5 z ^ 3, m(s,z) —> m(°°,z) = 1 when s^m and this limit is reached for s = (3 — z)/2. 
In both cases, m(°°,z) = m(l,z) hence H(z,°°) — H(z, 1). 

Because co and g^ 1 are smooth, the Taylor-Lagrange formula shows that, for every s ^ 
and every fixed z, 

fffe*) =//(z,0)+//'(z,0> + i/"(z,0)-+i/( 3 )(z,0)- + jf (jt- ff ) 3 ffW( z ,j)g, 

where all the derivatives are partial derivatives with respect to the second argument s. 

Simple computations yield H(z,0) = and the values of the three derivatives H'(z,0), 
H"(z,0) and H^(z,0) as combinations of CO and of partial derivatives of co, evaluated at 
the point (#,0), where 3e~ 4 * = z. 

Furthermore, the hypothesis on co ensures that (z, •) is bounded, in the following sense: 
there exist positive numbers so and Kq such that for every s in [0,sq] and every z in 

# (4) (z,s) ^24k . 
Hence, 1 = (T e , 7j) fulfills the first condition to be tempered, with 

k = 3, a = l. £i = l, £2 = 2, £3 = 3, k=Kq, 
and, for every ^ 1 : ^ 2, 

^■(z)=// (i+1) (z,0)///(z,l). 
Finally, since CO is smooth, the functions Fi are bounded on 7, . 

A.2. Proof of Proposition 12.71 Recall that, using the random variables S e = e~ 47 = and 

S{ = 1 +2e~ 4r ', the function G is characterized by the fact that, for every measurable 
bounded function H, 

E{H(S e Si) : S e (3-Si) sC 2s) =E(H(S e Si)G(S e Si,s)) . 

Here, S e and Si are independent, the distribution of 5 e is uniform on [0, 1] and the distribu- 
tion of Si is discrete with 

P(Si=y„)=r„/r. 

Thus, 

H{xy n )\{x{3 —y„) < 2s}dx = Y r„ / H{xy n )G{xy n ,s)Ax. 

n JO 

The changes of variable z = y n x in each integral yield 

£- [H(z)l{z^y n }l{3z^(2s + z)y n }dz = Y<- [ H(z) l{z ^ y„}G(z,s)dz. 
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This must hold for every measurable bounded function H, hence 

G(z,s)=H(z,s)/H(z,«>), H(z,s)=Y,(rn/y n )Hz^y„}l{3z^(2s + z)y n }. 

n 

Since r n /y n = rT b — (n + l)~ h for n ^ 1, H(z,s) — n(z 7 s)~ b where 

n(z,s) =inf{n ^ 1 |z<y„, 3z sj (2s + z)y„}. 

Since y„ — >• 3 when « — >• oo, n(z, i) is finite for every z < 3 and s > 0. 

For every z > 0, when s is large enough, namely s ^ (3 — z)/2, the condition that 3z ^ 
(2s + z)y„ becomes useless and 

n(z,s) = inf{n ^ 1 |z<y„}, 

hence n(z,s) and H(z,s) are independent of s. If z ^ 1, this implies that n(z,s) and H(z,s) 
are independent of s ^ 1 . If z < 1 and s ^ 1 , the conditions that z ^ y„ and 3z ^ (2s + z)y n 
both hold for every n ^ 1 hence n(z,s) = 1 and H(z,s) = 1. In both cases, H(z,°°) — 
H(z,l). 

We are interested in small positive values of s. For every z < 3, when s is small enough, 
namely s ^ (3 — z)/2, the condition z ^ y n becomes useless and 

n(z,s) =inf{n ^ 1 |3z^ (2s + z)y n }, 
When furthermore s < z, n ^ n(z,s) is equivalent to the condition 

n- a ^h(s/z), with h(u) = --\n( \ — |,0^i/<l. 

w ' w 4 V 1 + 2 « / 

Finally, for every s < min{z, (3 — z) /2}, n(z, s) is the unique integer such that 

n(z,s)-l<h(s/zy l/ " ^n{z,s). 

This reads as 

h{u) b l"[\ +h(u) x l a ]- b < H(z,l)G(z,s) ^ h(u) b ' a , u = s/z. 
One sees that the function h is analytic and that h(u) — (3w/4) +o(u) when u — > 0, hence, 

h(u) b / a = (3u/4) b / a (l+a 1 u + a 2 u 2 +a 3 u 3 +o(u 3 )), 

when u — > 0, for given coefficients a\, a 2 and aj,. Likewise, since \/a>3,h{ufl a = o{u 3 ) 
when u — V 0. This implies that 

hence 

H(z,l)G(z,s) = (3u/4-) b/a (l+a l u + a 2 u 2 + a 3 u 3 +o(u 3 )). 
This yields the first part of Definition ^. 31 with 

k = 3, a = b/a, (ei,e 2 ,e 3 ) = (1,2,3), 

and 

F (z) = (3/4z) b/ "/H(z,l), F x {z) = a x F {z)/z, F x (z) = a 2 F Q (z)/z 2 . 
The remaining step is to get rid of the dependencies over z of our upper bounds. For 
instance, the reasoning above provides as an error term a multiple of 

u a+3 /H{z, 1) = s a+3 /{z a+3 H{z, 1)), 

instead of a constant multiple of s a+3 . But inf/, > 0, hence the l/z a+3 contribution is 
uniformly bounded. 
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As regards H(z, 1), we first note that H(z, 1) = 1 if z ^ 1. Ifz^ 1 , elementary computations 
show that H(z,l) ^ c if and only if n(z, 1) ^ c" 1 / 6 if and only if exp(-c a / fo ) > (z - l)/2, 
which is implied by the fact that 1 — c"l b ^ (z — l)/2, which is equivalent to the upper 
bound c"l b < (3-z)/2. Since sup/ r < 3, this can be achieved uniformly over z in /, and 
l/H(z, 1) is uniformly bounded as well. 

Finally, we asked for an expansion valid on s ^ sq, for a fixed so, and we proved an expan- 
sion valid over s/z ^ mo, for a fixed «o- But one can choose sq = UQinfI t . This concludes 
the proof that the conditions in the first part of Definition ^. 3l hold. 

We now prove that the second part of Definition ^. 3 I holds. Since 7] and T e are independent, 
for every positive integer n, 

Q n (t)=F(T i ^l/n)¥{t^T e ^t+l/n). 

One has 

nP (t ^ T e ^ t + 1 In) 4e~ 4 ' when n +°°, 

and 

1 3 

— n rr s^P 7i< 1/n K — rr- 

r(n l / a + l) b K ' ' rn b l" 

Since Q n (t) is bounded from below by a multiple of 1 /n l+b ' a , the second point of Defini- 
tion [23] holds. 



A.3. Proof of Proposition 12.81 Recall once again that, using the random variables S e = 
e~ 4Te and Si = 1 + 2e~ 47 i, the function G is characterized by the fact that, for every mea- 
surable bounded function H, 

E(H(S e Si) : S e (3 -Si) < 2s) = E(H(S s S i )G(S e S i ,s)) . 

Case (i). Here, S e and Si are independent, the distribution of S e is uniform on [0, 1] and Si 
is a continuous random variable with density 

JS5 L_ J i { i + 2.-<.o} 

with respect to the Lebesgue measure. Let GJ denote the joint probability density defined 

as 

®(x,y) = l{0^x^ l}l{l+2e- 4e s$yO} ' 



40(y-l) 
Thus, 

JJ H(xy)l{3x s^2s +xy}(D(x,y)dxdy — JJ H(xy)G(xy,s)(B(x,y)dxdy. 
The change of variable z—xy yields 

JJ H(z)l{3x ^2s + z}Q(x,z/x)dzdx/x = JJ H(z)G(z,s)® (x,z/x)dzdx/x. 
This must hold for every measurable bounded function H, one can choose 

G(z,s)=H(z,s)/H(z,°°), 

with 

H(z,s) = J l{3x^2s + z}l{0^x^l}l{l+2e- 4e ^z/x^3}dx/(z-x). 
Finally, for every s ^ and z in [0, 3], 

G(z,s)=//(z,s)///(z,l+e- 49 ), 
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where 

H{z,s)= , withm(.s,z) =min{l,z/(l+2e- 49 ),(2 i + z)/3}. 

Jm(0,z) Z-X 

Hence, m(Q,z) = z/3 and, for small positive values of 5, m(s,z) = m(0,z) + 2s/3. When 
^ z ^ 1 + 2e~ 4e , m(s,z) — > m(°°,z) = z/(l + 2e _4e ) when s—> °° and this limit is reached 
for s — ^2 e -48 z - When 1 + 2e~ 49 ^ z ^ 3, m(s,z) — > m(°°,z) = 1 when s — > °° and this 
limit is reached for s = (3 — z)/2. In both cases, m(°o,z) —m(l +e~ 49 .z) hence H(z 7 °°) = 
H(z,l+e- 46 ). 

1 +e -49 

For every fixed 0^z€ 1 + 2e~ 4e and every ^ s ^ j^z, 

^ l+2e- 4e 

z 



ff(z,*)=log 

For every fixed 1 + 2e~ 49 ^ z ^ 3 and every 0^5^ 

H(z,s)= log 



z — s y 

3-z 



2 

z 



Hence, there exists a positive so suc h that for every z in I t and every 5 in [0,*o]> 



H{z,s)=\og\^— J =log( 1- 

Such a function has a Taylor expansion around 5 = with uniformly bounded coefficient 
over z in Hence, X — (T e ,T\) fulfills the first condition to be tempered. 

We now prove that the second part of Definition ^. 3 I holds. Since 7] and T e are independent, 
for every positive integer n, 

Q„(?) = P(7i< l/«)P(r <r e <f + l/n). 

One has 

nP(f <7;^f+l/«)^4e~ 4 ' when n -)• +°°, 

and 

P(7; ^ l/«) = — , when n is large enough. 
9n 

Since Q„ (f ) is bounded from below by a multiple of 1 / n 2 , the second point of definition ^. 31 
holds. 

Case (ii). Here, S e and S; are independent, the distribution of S e is uniform on [0, 1] and 
is a continuous random variable with density 



6 



1, 1 
-log 



-l 



l{l+2e- 4 ^5; <3} 



4(5,-1) 

with respect to the Lebesgue measure. One can choose 

G(z,s)=H{z,s)/H{z,°o) 

where 



rm(s,z) 

H(z,s) = \ 



-1, fz-x 

— log 

4 B \ 2x 



1 Ax 



z — x 



with 

m(5,z) =min{l,z/(l +2e- 4 ), (25 + z)/3}. 
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Hence, m(0,z) = z/3 and, for small positive values of s, m(s,z) = m(0,z) + 2s/3. When 
^ z ^ 1 + 2e~ 4 , m(s,z) — > m(°°,z) = z/(l + 2e~ 4 ) when .? — > °° and this limit is reached 
for s = j^-^z. When 1 + 2e~ 4 ^ z ^ 3, m(s,z) — > m(°°,z) — 1 when s — > °° and this 

limit is reached for s — (3 — z)/2. In both cases, m(°°,z) = m( \ + e~ 4 ,z) hence H(z,°°) — 
//(z,l+e- 4 ). 

Hence, there exists a positive so such that for every z in 7 r and every s in [0, so]* 



H(z,s) 



o lz — *) 



-r^-logl i 



3x 
2x + z 



dx. 



Such a function has a Taylor expansion around 5 = with uniformly bounded coefficient 
over z in For instance, when = 1/2, 



3/2, 9%/3 5/2 + Q( 7/ 2) 

3i V (3z) 3 /2 J + 40z 5 /2 5 + t ^ a h 



4 - 1 5 



where 0{s 1 / 2 ) is uniformly bounded over z in Hence, T = (T e ,T{) fulfills the first con- 
dition to be tempered. 

We now prove that the second part of Definition ^. 3 I holds. Since 7] and T e are independent, 
for every positive integer n, 

Q n (f)=P(7i< l/n)P(f <r e <t + l/n). 

One has 

rcP(f <7;^f+l/n)^4e~ 4 ' when n -> +°°, 

and 

PfT] ^ l/n) = —fi , when n is large enough. 
«° 

Since <2„(f ) is bounded from below by a multiple of 1 /n 1+e , the second point of defini- 
tion 153] holds. 



Case (iii). Here, S e and Si are independent, the distribution of S e is uniform on [0, 1] and 
Si is a continuous random variable with density 



1 



■log 



1 

-4 log 



Si -I 



l{l+2e- 4 0, <3} 



4(5,-1 

with respect to the Lebesgue measure. 
One can choose 

G(z,s)=H(z,s)/H(z,<*>), 

where 

' Z — x 



H(z,s) 



m(s,z) 
(0-z) 



log 



2x 



dx 



z — x 



with 



m{s,z) =min{l,z/(l+2e- 4 ),(2 5 + z)/3}. 
Hence, there exists a positive sq such that for every z in I t and every s in [0, so], 



H(z,s) 



■log 



log 1- 



3x 
2x + z 



dx. 



k (Z-x) 

The Taylor expansion around zero of H(z,s) reads as 

zH(z,s) = (1 - log(3/(4z)))i - s log(s) +o(s log(s)), 
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hence T = (T e ,Ti) does not fulfill the first condition to be tempered. 

Case (iv). Here, S e and S[ are independent, the distribution of S e is uniform on [0, 1] and 
Si is a continuous random variable with density 



1 



■log 



Si - 1 



log 



-I log 



Si -I 



16(5,-1 

with respect to the Lebesgue measure. 
One can choose 

G(z,s)=H(z,s)/H(z,°°), 

where 



l{l+2e- 4 ^ <3} 



H(z,s) = / log 



z — x 
2x 



log 



1 



■log 



z — x 
2x 



dx 



z — x 



with 



m(s,z) =min{l,z/(l+2e- 4 ),(2i + z)/3}. 
Hence, there exists a positive sq such that for every z in I t and every s in [0, so], 



H(z,s) 



1 



o (z — x) 



■log 1 



3x 



log 



- 1 ! (, 3X 
log 1 

4 & \ 2x + z 



dx. 



2x + z y 

The Taylor expansion around zero of H(z,s) reads as 

2z 2 H{z,s) = (3/2 - 31og(3) + 31og(z) +61og(2))s 2 - 3s 2 log(s) +o(s 2 log(s)), 
hence T = (T e , 7j) does not fulfill the first condition to be tempered. 

Appendix B. Proof of Proposition 13. 21 

Notation B.l. Recall that T denotes the Gamma function defined for every positive number 
x by 

"V^e-'df. 



T(x) 

For every real number t, let [t] denote the integer part oft, that is, the largest integer not 
greater than t, and let {t} denote the fractional part oft, hence t = {t} + [t], [t] is an integer 
and {t } belongs to the interval [0, 1). 

For fixed values of the coefficients a, Ji and e,, introduce, for every t > 0, 



Hence, 



and 



where 



M± = / tv^FitvJdv, where F±(v) = V ft (1 - v) a+e - ± y„(l - v) 
J o f~0 

',= [ tv'- l P{V ^v)dv = M±+ [ fv'-'pW ^ v)-F±(v)]dv. 
Jo Jo 

Mf =tB(t, a + 1) ('Y J YiM£ht)P(e i ,t)±y n A{e„,t)P(e n ,t) \ , 



a+e„ 



A(e,/) 



r({f } + q+l) 

r(«- 



M+i 



— — — — ^tt , p( £) r)=n h— 



: + £ + {?} + V ' 



PRIORS FOR THE BAYESIAN STAR PARADOX 



19 



and B denotes the beta function 

T{x + y) 

From the control of the distribution of V, 

n 

M 7 - 7 v 'o < M t < M+ + v' where y = £ | y | . 

i=0 

Combining this with the general expression of Mf given above, one gets 
M t+ i ^ {t + l)B{t + l 7 a + l)x + (t + l) + v t + 1 
M t ^ tB(t,a + l)z-(t)-tf 

where 

n-l 

X±{t) = £ y i A(e i ,t)P(Si,t)±y n A(e n ,t)P(£ n ,t). 

i=0 

Using the fact that 

(t + l)B(t+l,CC + l) t + 1 



tB(t,a + l) t + a + l' 

and that 

, . , „ . , (f + a)(f + a- l)...(f + {a» 
f2?(f,a + l)fi a (0>l> where<2a(0 = - ~ r(a + 1) — > 

one sees that 

M l+i < f + l yo + z+(f + i) + 6a(f + i)v +1 

Furthermore, 

y + ^ + (f + l) + e«(f + l)v{) +1 _ 1 | Z+(f + l)-Z-(0 + g(0v 
» + Z-(0-Qa(Ovo X) + *-(') -rGa(f)Vo 

where jc(f) = vo2a(? + 1) + 7(2a( f ) i s a polynomial function in f . 

From Lemma lBT2l below. there exists a positive number C which depend on the exponents 
a and £,-, ^ i ^ n, only, such that 

Z+C + l) -*-(') < [2y n + e n y]Ct-P, X -(t) > -Cyt^ 1 . 

where 

j3 =min{e„,l +ei}, 1 < j8 2. 

Combining these estimates on % + (t + 1) and £-(f), one sees that there exists finite contin- 
uous functions 0\ and A of the exponents y, a, and £,, such that, for every t > 9u 

R, ^ a/t-A/t 13 . 

Since /3 > 1, there exists 02 such that 2At < a/^ for every t ^ 02- Choosing finally 
= max(0i , 02) yields Proposition l3.2l 

Lemma B.2. Let j3 = min{£„, 1 + £i}. There exists a positive number C, which depends 
on the exponents a and £,■ only, such that 

X+{t + l)-X-(t) [2y n + e„y}CrP, %-{t) > -Cyt^ . 
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Proof of Lemma GO] For every real number t 1 and every 1 ^ i ^ n, 

*-stof)-n*if) ^P(e i ,t)^e- S ^' t \ 

where 

M+i £ M+i £ 2 

Thus, there exists two positive real numbers C[ and such that for every real number 
t > 1, CT < t £i P{£i,t) < C ( + , and one can choose C+ = (a + e, + 3) £i . 

Let C = max{C ( + ; 1 ^ i ^ n}. Using the two relations 

Pfe-,r)-P( et ,r + l)=P( £ ,-,r) a + ^ +r + 2 , 

and 

P(e„,f)+P(e„,f + l)=P(e„,f) ^2- 

one sees that 



a + e„ + f + 2/ ' 



X+(t + l)-X-{t) = 2r n A(e n ,t)P(en,t) - £ ^■A(e i ,f)P(e i ,/ 



. 1 a + Ej + t + 2 



For every 1 < i ^ n, the function A(e, , •) is positive and bounded by 1 . Hence, 

X+{t + 1) - *-(*) < 2 7n P(e n ,t) + £ |^|P( ei ,f) 



, , a + e,- + r + 2 



(2 r „r £ " + 7 e „f-( 1+£ ')) 



and the first inequality in the statement of the lemma holds. The same kind of estimates 
yields 

n-1 

hence the second inequality holds. This concludes the proof of Lemma [B72l □ 
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